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Methods for Silhouette Extraction 



Field of the Invention 

The invention relates to image processing, in 
particular methods of extracting a silhouette from an image. 

5 Background of the Invention 



technique in finding silhouettes in monochrome images. An edge 
is defined as a location in an image with high grey-level 
contrasts, i.e. there is a large jump in grey level from one 
10 pixel to the next . There are a number of standard algorithms 

for detecting edges based on, for example the Sobel and Laplace 
methods . 



level contrast, so that an edge is formed when the contrast is 
15 above that threshold. For the silhouette extraction problem it 
is very difficult to provide a threshold that is consistent for 
all situations. If the threshold value is too low, then edges 
will be indicated everywhere in the image, both inside the 
object for which the silhouette is being determined and in the 
20 background. In this case it will be difficult to extract the 
silhouette from the myriad of edges. / 



not be indicated along those parts of the silhouette where the 
difference in grey level between the edge of the object for 

25 which the silhouette is being determined and the background is 
small. For example in the case of a head-and-upper torso 
portrait of -a person, this may occur when the grey level of 
hair, skin or clothing is nearly the same as that of the 
background. Without clearly defining the object edges, the 

30 silhouette of the object cannot be found. 



In image processing, edge detection is the usual 



In edge detection, a threshold is set for a grey - 



If the threshold value is too high, then the edge may 
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As such existing methods rely on the grey levels and 
textures of the object and the background, as well as the 
lighting conditions at the time the image is captured and the 
exposure setting and other characteristics of the camera 

5 capturing the image. In order for these edge detection methods 
to succeed, the colour of the background must be carefully 
chosen and a sophisticated camera and lighting setup is 
required. This can be inconvenient, especially when a system 
used for imaging head-and-upper torso portraits is imaging many 

10 people with different hair colour, clothing and complexions. 



Summary of the Invention 

In comparison with the described edge detection 
methods above, the invention does not require the detection of 
edges; actually it makes no use of the contrast in grey level 
values in the image or picture being processed. Consequently, 
the invention is immune to the grey level values or textures of 
the object for which the silhouette is being determined or the 
background, and also immune to the camera and lighting setups. 
It works well even when the grey level value at an edge of the 
object is very close to that of the background. 

According to a first broad aspect of the invention, 
there is provided a method comprising: extracting the 
silhouette of an object against a fairly plain background in an 
image comprising a plurality of pixels by processing the image, 
25 the processing comprising determining if adjacent pixels of the 
image have an equal grey level value and the processing is 
independent of the numerical values of the original grey level 
values of pixels of the image. 

According to a second broad aspect of the invention, 
30 there is provided a computer readable medium having computer 
readable program code means embodied therein for extracting a 
silhouette against a fairly plain background from an image 



15 



20 
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comprising a plurality of pixels, the computer readable code 
means comprising: code means for processing the image, the 
processing comprising determining if adjacent pixels of the 
image have an equal grey level value and the processing is 
5 independent of the numerical values of the original grey level 
values of pixels of the image. 

Other aspects and features of the present invention 
will become apparent to those ordinarily skilled in the art 
upon review of the following description of specific 
10 embodiments of the invention in conjunction with the 
accompanying figures . 

Brief Description of the Drawings 

Preferred embodiments of the invention will now be 
described with reference to the attached drawings in which: 

15 Figure 1 is a flow chart of a method for extracting a 

silhouette according to an embodiment of the invention; 

Figure 2A is an example grey scale images- 
Figure 2B is the example grey scale image of Figure 

2a in which the grey levels of the pixels are transformed by an 
20 averaging operations- 
Figure 2C is the grey scale image of Figure 2B in 

which iso-grey area (IGA) regions are determined according to 

an embodiment of the invention and are marked with a dark 

border; 

25 Figure 2D is a grey scale image in which the grey 

levels of the pixels in an IGA region are equal to the number 
of pixels in the respective IGA region; 

Figure 3A is an example image for which a silhouette 
is to be extracted; 
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Figure 3B is the image of Figure 3A after initial 
processing of the image with the averaging operation; 

Figure 4 is the image of Figure 3B that has been 
processed to be an IGA image; 

5 Figure 5 is the image of Figure 4 that has been 

processed by analyzing the area outside the region currently 
determined to be silhouette; 

Figure 6 is the image of Figure 5 that has been 
processed by a dilation operation; 

10 Figure 7 is an enlarged scale version of the image of 

Figure 6 that has been processed by analyzing the area inside 
the region currently determined to be silhouette; 

Figure 8 is the image of Figure 3A with the extracted 
silhouette superimposed on it; and 

15 Figures 9A, 9B and 9C are extracted silhouettes of an 

object in an image where the silhouettes are superimposed over 
the image after a first pass of the method (Figure 9A) and 
repetitions of the method to tighten the silhouette (Figures 9B 
and 9C) ; 

20 Detailed Description of the Preferred Embodiments 

The embodiments of the present invention disclosed 
herein provide an alternative to using conventional edge 
detection techniques for extracting a silhouette of an object 
with simple boundary from an image. Methods of the invention 

25 can be used for extracting the silhouette from an image of the 
object against a fairly plain background by a single monochrome 
camera. An example of the object is a head and upper body 
portion of a person in a head- and- upper torso portrait, such as 
the type taken for a passport photo or a driver's licence. A 

30 silhouette is an outline of the object in an image. In the 
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example above the silhouette is the curve that separates the 
head -and -upper torso portrait of a person including hair, face, 
ears, neck and upper body from the background, where the hair, 
the face, and the upper body are within the boundary of the 
silhouette. More generally, the image may include any type of 
object with simple boundary in the image for which the 
silhouette is to be extracted. A ''fairly plain background" is 
meant to define that the background does not contain any 
additional objects or people and does not have any distinctive 
visual patterns, such as lines. A plain wall or backdrop of any 
color lit by ambient light will satisfy the definition of a 
''fairly plain background" , even if brightness in the image is 
not uniform due to uneven lighting. In a case of a portrait of 
a person where the person is standing in close proximity to a 
wall, a well-defined shadow of the person's head being cast on 
the wall by a spot light may produce a distinctive visual 
pattern in the image that would affect the ability of the 
method to properly obtain the accurate silhouette of the 
person . 

20 In some embodiments of the invention the method is 

expressed as computer implemented program code that is stored 
on a computer readable medium. The computer implemented program 
code can be run from the computer readable medium by a 
processing engine adapted to run such computer implemented 

25 program code. Examples of types of computer readable media are 
portable computer readable media such as 3.5" floppy disks, 
compact disc ROM media, or a more fixed location computer 
readable media such as a hard disk media storage unit in a 
desktop computer, laptop computer or central server providing 

30 memory storage or a workstation. 

An image or picture for which embodiments of the 
invention will operate on is stored in a digital format on a 
computer readable memory. The image may originate as a digital 



10 
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image from a digital camera or an analog image that has been 
digitized, for example a photo scanned by a scanner and stored 
in a computer readable memory. 

A camera that captures a monochrome image is capable 
5 of producing a single picture in multiple grey levels. A camera 
that captures a colour image can be used to provide images 
which can be used with embodiments of the invention by first 
converting the colour image into a monochrome image by 
averaging the red, green and blue components of the pixels of 
10 the image. Generally, the monochrome picture is considered to 
be a rectangular array of pixels, where each pixel has an 
associated grey level . 

A method according to an embodiment of the invention 
will now be described with respect to the flow chart of Figure 

15 1. The method starts at step 100. At step 110, the image 

obtained from a source such as the camera described above is 
input for silhouette detection. Attributes of the image that 
are provided are the number of rows and columns in the image, 
the number of grey levels in the image, and the grey level of 

20 each pixel in the image. At step 120, the grey levels of the 
pixels are initially processed with the purpose of creating 
numerous small iso-grey area (IGA) regions along the edge of 
the silhouette so that the silhouette becomes more prominent. 
At step 13 0, the IGA regions are further defined in the image, 

25 for example by modifying the actual grey level of the image in 
each respective IGA region to be a grey level equal to the 
number of pixels in that respective IGA region. In step 140, 
the area of the image outside the object associated with the 
silhouette that is being extracted, is processed. At step 150, 

30 additional processing is performed to improve the boundary of 
the silhouette with respect to the background. At step 160, the 
area of the image inside the object associated with the 
silhouette that is being extracted, is processed. At step 170, 
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a decision is to be made whether to further refine or ''tighten" 
the silhouette shape. If it is decided that the silhouette is 
to be refined, then the '"yes" path is followed and the image is 
further processed starting at step 180 and then repeating steps 
5 130-170. If it is decided that the silhouette is not to be 
refined, then the '"no" path is followed and the silhouette is 
output at step 190. The method ends at step 195. 

Not all steps in the method of Figure 1 are necessary 
to extract the silhouette. In some embodiments of the invention 
10 some of the processing steps or refining steps may not be 

performed if a less precise silhouette is suitable for a user's 
needs . 

The steps of the method will now be described in more 

detail . 

15 Process Pixels in Image 

At step 12 0, the grey levels of the pixels are 
modified with the purpose of creating numerous small iso-grey 
area (IGA) regions along the edge of the silhouette so that the 
silhouette becomes more prominent. IGA regions are groups of 

20 connected pixels having the same grey level value. There are a 
number of ways to operate on the pixels, for example using 
statistical, convolution and/or morphology techniques. Two 
particularly effective operations used in embodiments of the 
invention are calculating an average grey level for each pixel 

25 based on a group of pixels in close proximity to the respective 
pixel and calculating a median grey level for each pixel based 
on a group of pixels in close proximity to the respective 
pixel . 

In determining the average grey level of a group of 
30 pixels, for each pixel in the image, the average (that is 

arithmetic mean) grey level of 9 pixels in a 3X3 square around 
a respective pixel is determined. This determined average grey 
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level value is selected as the new grey level of that pixel . 
Since grey levels are integers, the average is rounded to the 
nearest integer value. 

In determining the median grey level of a group of 
5 pixels, for each pixel in the picture, the median grey level of 

the 9 pixels in the 3X3 square around a respective pixel is 
determined and used as the new grey level of that pixel . 

Determining the average and/or median grey level is 
performed based on the original pixel values of the image and 
10 not based on the new pixel values determined by the average 
and/or median calculation. 

Pixels along the edges of the image will have fewer 
pixels around them than make up a complete 3X3 square, but 
methods for finding the average and the median are similar to 
15 pixels with a group of 3X3 pixels around them. 

More generally, the number of pixels used in 
calculating the average or median may be greater than or less 
than the 3X3 block example described above. 

Mathematical operations used in embodiments of the 
20 invention that have been found to be preferable for processing 
the pixels of the image are 1) ''Average" only, 2) ''Median then 
Average", and 3) "Average then Median then Average". However, 
alternative forms of these or other mathematical operations 
that provide appropriate processing are to be considered within 
25 the scope of the invention. 

Figures 2A and 2B will now be described in order to 
understand how step 110 initially aids in creating small IGA 
regions. Figure 2A is an example image containing an array of 
pixels where each pixel of the image is represented by a square 
30 block and the grey level for each block is shown as a number 
inside each block. In Figure 2A, pixels generally indicated at 
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2 00 have a grey level equal to ''0" (black) on one side and 
pixels generally indicated at 205 have a grey level equal to 
"2" (nearly black) on the other. You may think of a nearly 
black object against a black background. Figure 2B is obtained 
5 after the ^'Average" operation described above is performed on 
the pixels of the image. The grey levels of some pixels, 
generally indicated at 215 are transformed to be equal to a 
grey level of ^'1" along the boundary between a region having 
pixels with a grey level equal to '"0", generally indicated at 
10 210 and a region having pixels with a grey level equal to ''2" . 
These pixels will form small IGA regions along the silhouette, 
as described in more detail below. 

One may consider that if the grey levels of the 
object and the background differ by only a grey level equal to 
15 1 instead of 2, then the above effect will not occur. Strictly 
speaking, this is true, but the chance of such a case is small. 
Furthermore, to handle such a case, in some embodiments of the 
invention the grey levels of all pixels in the image are 
multiplied by 2 before doing the ''Average" operation. 

20 Form IGA Picture 

At step 13 0, the IGA regions are formed in the image 
that has been altered by the '^Average" and/or ''Median" 
operations described above, and denoted as picture P from 
hereon. Step 13 0 generates an IGA picture Q from picture P, so 
25 that in future steps the silhouette is extracted from picture 
Q, and not from picture P. 

The IGA picture Q has the same size, that is the same 
numbers of rows and columns, as picture P. Preferably picture Q 
is also monochrome with usually, but not necessarily, the same 
30 grey level granularity as picture P. 



Picture P is first partitioned into mutually disjoint 
"4 -connected" regions such that the pixels in each region have 
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equal grey levels. These regions in picture P are called iso- 
grey regions. A subset of pixels is said to be ''4 -connected" if 
any two pixels in the subset are connected by a path that may 
or may not include other pixels in the subset, such that 
5 adjacent pixels with an equal grey level along the path are on 
the left/right of or above/below each other. In other words, if 
a pixel is considered a square, a pixel in a ''4 -connected" 
subset can only be connected to pixels on one of the four sides 
of the square. Regions that are ''8 -connected" allow for 
10 diagonal adjacency, as an ''8 -connected" region includes corner 
pixels of a 3X3 block of pixels around a given pixel, in 
addition to the 4 pixels, one on each side, surrounding the 
given pixel . 

There are many ways of partitioning picture P into 

15 iso-grey regions. In some embodiments of the invention one 

method used is called "flooding" . Essentially, it starts with a 
pixel in an iso-grey region that has not yet been modified to 
be an iso-grey region and "floods" the neighbouring unmodified 
pixels that have the same grey level. This same process is 

20 repeated recursively for the flooded pixels. The area is 

augmented each time a pixel is flooded. To cut the overhead 
cost in recursion and to prevent possible stack overflow, a 
queue data- structure is used to perform the flooding operation. 
In some embodiments of the invention, a "breadth-first search" 

25 technique is used. Each unmodified pixel is loaded into the 
queue, and the following steps are repeated until the queue 
becomes empty: pop the head of the queue, mark it as modified, 
augment the count, and push those neighbours of the pixel at 
the head of the queue with the same grey level as the pixel at 

30 the head of the queue to the tail of the queue. 

The number of pixels in each iso-grey region of 
picture P are counted. The resulting number of pixels in each 
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region is called the ''area" of that region and the region is 
called an IGA region. 

The IGA picture Q is formed by equating the grey 
level of each pixel in picture Q to the IGA value of the 
5 corresponding pixel in picture P. 

Referring to Figure 2C, the image represents the 
processed picture P of Figure 2B, with 4 distinct iso-grey 
regions identified by bold lined boundaries where the grey 
level equal to ''0" region is indicated by 210 as in Figure 2B, 

10 the grey level equal to ^'2" region is indicated by 220 as in 

Figure 2B and the grey level equal to ^^1" regions are indicated 
by two separate regions 225 and 230 instead of only 215 in 
Figure 2B. Figure 2D represents the IGA picture Q whose grey 
levels are the corresponding IGA values. A first ''4 -connected" 

15 region indicated at 235, has a grey level set to 32 as 32 

pixels each have the same grey level as shown in 210 of Figure 
2C. A second "4 -connected" region indicated at 240 has a grey 
level set to 3 0 as 3 0 pixels each have the same grey level as 
shown in 220 of Figure 2C. A third "4 -connected" region 

20 indicated at 245 has a grey level set to 8 as 8 pixels each 

have the same grey level as shown in 225 of Figure 2C. A fourth 
"4-connected" region indicated at 250 has a grey level set to 2 
as 2 pixels each have the same grey level as shown in 230 of 
Figure 2C. It is noted that the third and fourth regions have 

25 the same grey level but are not "4 -connected" in the manner 
described above. Therefore, the regions are different IGA 
regions. Based on the manner in which picture Q is created this 
further emphasizes that the method is not dependent on the grey 
level of the image in determining the silhouette. 

30 As described above, in Figure 2D the IGA values are 

used directly as grey level values in picture Q. However, 

usually the IGA values are transformed without interest in 
regions in picture Q that have very large IGA values. 
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Therefore, a user-selectable area threshold is introduced, 
which usually corresponds to the sizes of iso-grey regions in 
the background of picture P. 

In some embodiments of the invention, all pixels in 
5 picture Q whose IGA value is greater than or equal to this 

threshold are assigned a grey level of ^'white" , and all other 
pixels in picture Q will be given a proportionate grey level 
between black and light grey, for example 7/8 of the total grey 
level range. For 256 grey levels, this 7/8 portion would be 
10 from grey level 0, ^'black" , to grey level 224. Using light grey 
instead of white as the lower bound for pixels with IGA values 
smaller than the threshold may help in identifying the 
background from the object. 

For example, suppose that the original picture P has 
15 256 grey levels and it is desirable to maintain 256 grey levels 
for picture Q. Setting the area threshold to 100, any pixel in 
picture Q with an IGA value of 100 or above will be given a 
grey level of 255 (white) . Any pixel in picture Q in a given 
region with an IGA value of less than the threshold of 100 will 
20 be assigned a grey level between 0 and 224 that is proportional 
to the value between 0 and the threshold value of 100. In this 
specific example, this is a grey level equal to 32 + a*192/100, 
rounded to the nearest integer, where "a" is the number of 
pixels of the given region. More generally, any threshold value 
25 can be used in place of the value of 10 0 used in the example 
above . 

Note that after the IGA picture Q is obtained from 
picture P, picture P is no longer used. All future processing 
is based on picture Q and not on picture P. The method only 
30 checks whether adjacent pixels in picture P have the same or 
different grey levels, and uses this information to create the 
IGA picture and find the silhouette. The actual numeric grey 
level values of the pixels in P are not used beyond this point 
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in determining the silhouette. Therefore, the method is immune 
to brightness of the object, ambient lighting when the image is 
captured, setting of the camera used to capture the image and 
many other image dependent characteristics. 

5 Process Pixels External to Silhouette Object in Picture 

In step 140, the area of picture Q outside the object 
having the silhouette that is to be extracted, is processed. 
The silhouette is more apparent in the IGA picture than in the 
original picture. In order to extract the silhouette from the 

10 IGA picture, the image is analyzed using a disc shaped 

collision detector to define a coarse boundary around the 
object by analyzing the area outside the object and marking the 
coarse boundary. The disc shaped collision detector is moved 
around within an area of the image outside of the object of 

15 which the silhouette is to be determined to determine pixels 
that are not a part of the background. 

In some embodiments of the invention, the collision 
detector is a circular disc whose radius is not too large to 
get trapped in the background, and not too small so that 

20 detector enters into the interior of the object of which the 
silhouette is to be determined. In some embodiments of the 
invention the radius of the detector is mainly dictated by the 
degree of uniformity in the grey level of the background. In a 
particular embodiment, a detector disc radius of approximately 

25 6.5 times the pixel -width is an appropriate size to avoid the 
problems identified above. 

It is to be understood that the disc shaped collision 
detector is one example of a detector that is moved around the 
image external to the object. Other shapes for the detector may 
30 be utilized, such as a square for instance. 



The following is an example of how the disc shaped 
collision detector is used to process the area outside the 
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object. The detector is initially positioned on the IGA picture 
so that its centre is at a "white" pixel somewhere near a top 
edge of the picture close to the top left corner. It is then 
advanced to the pixel below it. The detector moves in all 
5 directions (except where it came from) over the course of 
processing making sure that its edge does not hit any non- 
'"white" pixel or a pixel that has already been visited. Each 
pixel that is encountered by the detector as it moves around 
the picture is identified in some appropriate manner as having 
10 been encountered, and each encountered '"white" pixel will be 

''marked" . The motion of the detector stops when it is no longer 
able to move without hitting non- "white" pixels or pixels 
identified as previously encountered. 

There are many ways of implementing the movement of 
15 the collision detector. A particular method is recursion. In 
some embodiments of the invention, to reduce overhead cost of 
the recursion method and to prevent possible stack overflow in 
the processor during implementation, a "breadth- first search" 
technique with a queue data-structure is used in the same way 
20 as in the flooding process described above. 

In a particular embodiment of the invention, for 
example to be used when processing a head-and-upper torso 
portrait image, moving the collision detector is repeated with 
the detector starting from 3 other locations, namely a "white" 

25 pixel somewhere on the top edge close to the top right corner 
(moving downwards) , a "white" pixel somewhere on the left edge 
(moving rightwards) , and a white pixel somewhere on the right 
edge (moving leftwards) . The third and fourth starting 
positions should be such that they are in the background, (e.g. 

30 above the shoulder for the person in the portrait) . 

Preferably, during each repetition, the detector does not 
traverse pixels that have been visited previously and can be so 
determined by the appropriate identification mentioned above. 
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It is to be understood that the repetitions of moving the 
detector are application specific and the number of repetitions 
and the starting position of the collision detector are 
selected as desired. Normally, one or two repetitions may be 
5 sufficient; the last few repetitions may not even start if all 
of the edges have been encountered by earlier repetitions. 

A problem known to occur with this method is, if the 
top of the silhouette object is very close to the top edge of 
the picture and the width of the gap between the silhouette 
10 object and the top edge of the picture is less than the radius 
of the collision detector, the detector cannot advance past the 
top of the silhouette object. This problem is easily solved by 
creating a '^buffer" area above the silhouette object for the 
disc to pass through. 

15 After the collision detector has completed the 

repetitions of movement outside of the object, the interior of 
the object should be left "unmarked". It may be advantageous to 
halve the grey levels of the "unmarked" pixels in the IGA 
picture, so as to distinguish them clearly from the "marked" 

20 ones. 

Process pixels in IGA picture 

Step 140 of moving the collision detector outside the 
object is used to identify the interior of the silhouette 
object (or more correctly, to identify those places that are 

25 not in the interior) . However, as a result the interior of the 
object is often defined to be slightly larger than the 
silhouette of the actual object due to the nature of the 
movement of the detector. The result can be improved by 
performing some additional processing, step 150, on the image 

30 output from step 140. For example, applying a dilation 

operation twice on the picture produces good results. More 
generally, one or more than one dilation operation may be 



FILE NO. 73616-13 

- 16 - 

applied. Dilation is a morphology operation in which the grey 
level of each pixel is replaced by the maximum grey level of 
the 9 pixels around it. The dilation operation causes a slight 
shrinkage of the object. While dilation is one specifically 
5 described operation performed at step 150, using other image 
processing techniques is considered to be within the scope of 
the invention. One such example of other image processing 
techniques is combining dilations with morphological opening or 
closing. Opening is erosion followed by dilation, and closing 
10 is dilation followed by erosion, where erosion is an operation 
that replaces the grey level of each pixel by the minimum grey 
level of the 9 pixels around it. 

Process pixels Within Silhouette Object 

Step 160 further aids in defining the silhouette of 
15 the object by analyzing the area within the coarse boundary 
established in step 140. Step 160 involves moving another 
collision detector within the interior of the silhouette 
object. The detector may be of a similar type to the disc 
shaped detector described above or it may have different 
20 parameters such as size or shape. 

The following is an example of how the disc shaped 
collision detector is used to process the area within the 
silhouette object. The detector is initially located at a 
position where it is believed that the object is located. For 

25 example, in the case of the head-and-upper torso portrait, the 
silhouette object is generally centered in the image with the 
upper body filling most of the bottom edge of the image. 
Therefore, the detector is located at a midpoint between the 
side edges of the image near the bottom edge. The detector is 

30 allowed to move freely over "non-white" pixels within the 
currently identified boundaries of the silhouette object. 
Whenever an edge of the detector encounters a ''white" pixel, 
(which should be just outside the silhouette object by virtue 
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of markers set in step 140) , the detector cannot proceed 
further, and the encountered pixel will be labelled as a 
boundary pixel of the silhouette. 

In this way, the silhouette of the silhouette object 
5 is obtained by collecting all of the labelled boundary pixels. 

The radius of the disc shaped region should not be 
set too small or it may escape into the area outside the 
silhouette object of the image. Similarly it should not be set 
too large else details in the curvature of the silhouette may 
10 be lost . 

In some embodiments of the invention, as with 
processing outside the silhouette object at step 140, breadth- 
first search with a queue data- structure is used instead of 
recursion to implement the inside the silhouette object 
15 processing. 

In a particular embodiment of the invention the 
silhouette image is output with a set of silhouette boundary 
pixels identifying the silhouette, where distances between 
adjacent boundary pixels are no less than some user-specified 

20 value d. To produce such an output, whenever the collision 

detector contacts a white pixel : a circle of radius d is drawn 
about the white pixel and every white pixel inside the circle 
(excluding the white pixel itself) is converted into a black 
pixel. In this way, it is ensured that the next contacted pixel 

25 will be at distance d from the white pixel. If desired, these 
silhouette boundary pixels can then be arranged so that they 
run along the silhouette in order. 

Output Silhouette 



The output at step 190 is an order set of silhouette 
30 boundary pixels, given by their row and column numbers. These 
pixels may be continuous or evenly spaced. 
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Figures 3 to 8 provide an example of an image 
processed using the method of Figure 1 . 

Figure 3A depicts an original picture of a mannequin 
with black hair to be operated on by the method of Figure 1. A 
5 processed picture by an ^^Average" operation in step 120 is 

shown in Figure 3B. A blurring effect can be observed along the 
silhouette, which produces the small IGA regions in step 130. 

Figure 4 shows the IGA picture Q representation for 
the image of Figure 3A resulting from step 130. The IGA picture 
10 Q in Figure 4 was generated using 100 as the area threshold. It 

can be seen that regions in the original input picture P that 
have varying grey levels, such as the face, become dark in the 
picture Q representation. 

The IGA picture of the image of Figure 3 A following 
15 step 140 is shown in Figure 5. The interior of the portrait are 
distinctively darker after halving the grey levels there. 

Figure 6 shows the effect of the pixel processing of 
step 150, specifically the dilation operation. We can see that 
this process not only affects the pixels of the silhouette 
20 object, but also the pixels in the background. 

The dotted silhouette of the original image from 
Figure 3A resulting from step 160 is shown in Figure 7. A 
larger image than that of the original is used so that the 
dotted silhouette can be seen. The black patches along the 
25 boundary of the silhouette are produced by ''painting" the 

circles around each contacted pixel black, as described above. 

Figure 8 is a visualization of the output at step 
190. It is formed from the original Figure 3A by turning the 
silhouette boundary pixels white. 



30 Process given picture further 
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In the above example, the method successfully 
generated the silhouette of the black-haired mannequin against 
a black background. If a person has fine hair that sticks out 
from their head, the method will produce a silhouette enclosing 
5 the fine pieces of hair. 

If the user wants to tighter the silhouette around 
the hairline, further processing can be performed on the image, 
as indicated in Figure 1 by decision step 170 and the yes path 
leading to step 180. In addition to the original types of 

10 processing performed at step 12 0 such as ''Average" operations 
on the given picture, further operations can also be 
performed- In some embodiments multiple iterations of the 
refining step are performed. A preferred additional operation 
for a first tightening is ''Median- then-Average" , and for a 

15 subsequent tightening a preferred operation is "Bias -then- 
Median- then- Average" . A "Bias" operation is a morphology 
operation in which for each pixel p, a pixel q is found in its 
3X3 neighbourhood with a grey level c closest to some biased 
value b, and the grey level c is assigned to p. So, if there 

20 exists a pixel in the neighbourhood with grey level less than 
or equal to b and there exists a pixel in the neighbourhood 
with grey level greater than or equal to b, then p will be 
assigned the grey level b. In some embodiments of the 
invention, for example to be used when processing the head-and- 

25 upper torso portrait, the average grey level that is used 
around the upper left and right corners as b. 

Figure 9A is a visualization, in a similar fashion to 
Figure 8, of the output for a different input image.' In this 
case the input image can be seen to be a mannequin with blonde 
30 hair that has a significant amount of portions of the hair that 
spread out in many different directions. In Figure 9A it is 
seen that the silhouette boundary pixels do not conform closely 
to the silhouette of the hair of the mannequin. Figure 9B shows 
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another visualization of the original image with a silhouette 
boundary that has been processed by repeating the method to 
tighten the silhouette boundary. Figure 9C shows a further 
visualization of the original image where the silhouette 
5 boundary has been processed by repeating the method another 
time to tighten the silhouette boundary. 

The method of Figure 1 works well in most cases. 
However, even though the method does not depend on the number 
of pixels in the picture or the number of grey levels, it may 

10 not work well for pictures with very low resolution and very 

small grey level granularity. A particular example in which the 
described method is effective is for pictures with 1024*768 
pixels and 256 grey levels. Also, as mentioned above, the 
method is not well suited for backgrounds that are far from 

15 being plain, with texture or shadows. 

Other factors that can affect the method are narrow 
and sporadic protrubances from the main object for which the 
silhouette is desired. For example, if a person has flamboyant 
hair with thin portions of hair directed away from the head, or 
20 if the person wears a long ornament like earrings. 

Furthermore, an exact representation of the 
silhouette may not be possible if some part of the person's 
hair, skin or clothing on the boundary between the silhouette 
and the background has a constant grey level that is exactly 
25 identical to the background. In the case of hair in particular, 
this is improbable because hair is made of many individual hair 
strands which usually differ by at least one grey level. 

Since the above-described factors are unlikely or can 
be avoided easily, the method is robust in most situations. 

30 The speed of the proposed method is comparable to the 

existing methods, as they roughly have the same degree of 
complexity. 
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Applications of the Algorithm 

The method as described with respect to Figure 1 was 
further illustrated by an example of an image of the head-and- 
upper torso of a person, or in actual fact a mannequin- A 
5 particular application of the method is for use in inside- 
engraving of crystals based on a portrait of a person, where 
finding the silhouette of the person is a first and essential 
step in the overall process. 

A potential use of the method is to replace blue (or 
10 green) screens in the movie industry. Blue screening is 

presently the standard method for producing special effects: an 
actor acts in front of a blue screen, and all the blue colour 
within some brightness range is replaced by a different 
background. Some limitations of the bluescreen technique are 
15 that the captured images are typically filmed in a studio with 
a specially painted blue wall and floor; it requires careful 
lighting setup; there should be no similar blue colour on the 
actor's wardrobe; and there are problems with shadows and "blue 
spills" onto the actor creating a blue tinge around the edges. 
20 Embodiments of the invention do not have similar problems. The 
shooting can be done in a studio with ordinary walls or in an 
outdoor environment . 

Embodiments of the invention can also be used in 
surveillance and security systems, where a silhouette helps in 
25 singling out a person or a face for facial recognition. 

For stereoscopic imaging of a person, the silhouettes 
obtained for the left and right images can be used to generate 
the position of the person using stereo disparity techniques on 
the silhouettes. 



30 Numerous modifications and variations of the present 

invention are possible in light of the above teachings. It is 
therefore to be understood that within the scope of the 
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appended claims, the invention may be practised otherwise than 
as specifically described herein. 



